Classification Structuring Tagging

نویسندگان

  • Ramón Aragüés Peleato
  • Jean-Cédric Chappelier
  • Martin Rajman
چکیده

This paper presents an information extraction system that processes the textual content of classiied newspaper advertisements in French. The system uses both lexical (words, regular expressions) and contextual information to structure the content of the ads on the basis of predeened thematic forms. The paper rst describes the enhanced tagging mechanism used for extraction. A quantitative evaluation of the system is then provided: scores of 99.0% precision/99.8% recall for domain identiication and 73% accuracy for information extraction were achieved, on the basis of a comparison with human annotators.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Paper-Centric Interaction Concepts for Collaborative Learning

Field studies show that in many learning settings paper has intrinsic advantages over electronic documents. In this paper we present concepts for the collaborative annotation and structuring of paper documents and digital documents in both distributed and co-located settings. The CoScribe prototype supports the annotation of printed lecture slides and collaborative sharing of annotations. Digit...

متن کامل

SemKey: A Semantic Collaborative Tagging System

By analysing the current structure and the usage patterns of collaborative tagging systems, we can find out many important aspects which still need to be improved. Problems related to synonymy, polysemy, different lexical forms, mispelling errors or alternate spellings, different levels of precision and different kinds of tag-to-resource association cause inconsistencies and reduce the efficien...

متن کامل

سیستم برچسب گذاری اجزای واژگانی کلام در زبان فارسی

Abstract: Part-Of-Speech (POS) tagging is essential work for many models and methods in other areas in natural language processing such as machine translation, spell checker, text-to-speech, automatic speech recognition, etc. So far, high accurate POS taggers have been created in many languages. In this paper, we focus on POS tagging in the Persian language. Because of problems in Persian POS t...

متن کامل

The Annotators ’ Perspective on Co - authoring with Structured Annotations

In asynchronous collaborative writing, annotations play an important roleas a communication medium among co-authors. Research has shown thatgrouping related annotations together can help those who review an anno-tated document by reducing their workload and raising the accuracy of theirreviewing. Less is known about the impact on users who create such struc-tured annotations...

متن کامل

Hierachic Texture Classification Using Morphological Gradients and Genetic Algorithms

A novel method for adaptively selecting texture features is presented. We use genetic algorithm to search for an optimal set of structuring elements which provides the best discrimination of textures. Moreover, a tree structure containing the selected set of structuring elements has been set up for classification. Experiments show that by the proposed method can achieve high classification accu...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2000